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BsiHKAI 

TGTGCTCTAAAGCCCCCATTCAAAATGCCATTTAACGCATTTGATACCTTCAAAGAA7LAA 
ACACGAGATTTCGGGGGTAAGTTTTACGGTAAATTGCGTAAACTATGGAAGTTTCTTTTT 
CALKPPFKMPFNAFDTFKEK 
Smal 

Xmal | BsmI 



ATTTTGAAACC C GGGAAGGAAGGAGTGAAGAATGCC GT AGGAGATTCGC TGGGGATCTTA 
61 + + + + + + 120 

T AAAAC TTT GGG CCCTTCCTTCC TC AC TTC TT AC GGC ATC CTC T AAGC GAC C C C T AGAAT 
ILKPGKEGVKNAVGDSLGIL 
TaqI 



C AAAGAAAACTC GATGGGAC C AAC GAGGAGGGAGATGCC ATTGAGC TGAGTGAGGAAGGA 

121 + + + + + + 180 

GTTTCTTTTGAGCTACCCTGGTTGCTCCTCCCTCTACGGTAACTCGACTCACTCCTTCCT 

QRKLDGTNEEGDAIELSEEG 

AlwNI 
PvuII 

StuI Bsgl PstI 



AGGCCTGTGCAGACATCCAGAGCCCGAGCCCCTGTGTGCGACTGCAGCTGCTGTGGCATC 
TCCGGACACGTCTGTAGGTCTCGGGCTCGGGGACACACGCTGACGTCGACGACACCGTAG 
RPVQTSRARAPVCDCSCCGI 

BspHI 



CCCAAGCGGTACATCATCGCTGTCATGAGTGGCCTGGGATTCTGCATTTCCTTTGGGATT 

241 + + + + + + 300 

GGGTTCGCCATGTAGTAGCGACAGTACTCACCGGACCCTAAGACGTAAAGGAAACCCTAA 

PKRYIIAVMSGLGFCISFGI 

MscI Xcml 



CGGTGCAACCTTGGAGTGGCCATTGTGGAAATGGTCAACAATAGCACTGTGTATGTGGAT 

301 + + + + + + 360 

GCCACGTTGGAACCTCACCGGTAACACCTTTACCAGTTGTTATCGTGACACATACACCTA 

RCNLGVAIVEMVNWSTVYVD 
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BsmBI 

BsaWI BamHI Bmrl EcoRI 



GGGAAAC C GGAAATC C AGAC AGC AC AGTTTAAC TGGGATCC AGAGAC GGTGGGAAGGGC G 

361 + + + + + + 420 

CCCTTTGGCCTTTAGGTCTGTCGTGTCAAATTGACCCTAGGTCTCTGCCACCCTTCCCGC 

GKPEIQTAQFNWDPETVGRA 

Maelll 

Ncol Tsp45I 



AATTCTCTTATCCATGGATCTTTTTTCTGGGGTTATATTGTGACACAAATTCCCGGTGGC 

421 + + + + + + 480 

TTAAGAGAATAGGTACCTAGAAAAAAGACCCCAATATAACACTGTGTTTAAGGGCCACCG 

NSLIHGSFFWGYIVTQIPGG 

Aatll 
BsaHI 



TTCATTTCAAACAAGTTTGCTGCTAACAGGGTCTTTGGAGCTGCCATCTTCTTGACGTCA 

481 + + + + + + 540 

AAGTAAAGTTTGTTCAAACGACGATTGTCCCAGAAACCTCGACGGTAGAAGAACTGCAGT 

FISNKFAANRVFGAAIFLTS 

XmnI Sac I I 



Pcil 



BstUI 



ACCCTGAACATGTTCATCCCTTCCGCGGCCAGGGTGCATTACGGCTGTGTCATGTGTGTG 

TGGGACTTGTACAAGTAGGGAAGGCGCCGGTCCCACGTAATGCCGACACAGTACACACAC 

TLNMFI PSAARVHYGCVMCV 

Maelll 
AlwNI Tsp45I 



AGG ATTTTGC AGGG TC TGGTGGAGGGT GTG AC C T AC C C AGC C TGC C ACGGG ATGTGGAGT 

601 + + + + + + 660 

TCCTAAAACGTCCCAGACCACCTCCCACACTGGATGGGTCGGACGGTGCCCTACACCTCA 

RILQGLVEGVTYPACHGMWS 

Bpml 

AAGTGGGCACCTCCCCTGGAGAGAAGTCGTCTAGCCACAACCTCTTTTTGTGGTTCCTAT 

661 + + + + + + 720 

TTCACCCGTGGAGGGGACCTCTCTTCAGCAGATCGGTGTTGGAGAAAAACACCAAGGATA 

KWAPPLERSRLATTSFCGSY 



Fig. 1 cont'd 
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BstAPI TatI 



GCCGGGGCAGTCGTTGCTATGCCCCTTGCAGGAGTATTGGTGCAGTACATTGGCTGGGCC 

721 + + + + + + 780 

CGGCCCCGTCAGCAACGATACGGGGAACGTCCTCATAACCACGTCATGTAACCGACCCGG 

AGAVVAMPLAGVLVQYIGWA 



Bsgl 



Pcil 



'"is 



TCTGCCTTTTATATTTACGGGATGTTTGGAATTATTTGGTACATGTTTTGGCTGCTGCTG 
781 + + + + + + 840 

AGACGGAAAATATAAATGC CCTAC AAAC C TTAATAAAC C ATGT AC AAAAC CGACGACGAC 
SAFYIYGMFGI IWYMFWLLL 



PstI 



HphI 



CAGGCTTATGAGTGTCCAGCAGTTCACC CAACAATATC CAATGAAGAACGGACCTACATA 

841 + + + + + + 900 

GTCCGAATACTCACAGGTCGTCAAGTGGGTTGTTATAGGTTACTTCTTGCCTGGATGTAT 

QAYECPAVHPTI SNEERTYI 



Bbel 
Haell 
Hhal 
Sfol 
BsaHI 
HinPlI 
Narl 
KasI 



MscI 
Bgll I 



Ncol 



GAGACAAGTATAGGAGAAGGCGCCAACTTGGCCAGTCTGAGCAAATTCAACACACCATGG 
CTCTGTTCATATCCTCTTCCGCGGTTGAACCGGTCAGACTCGTTTAAGTTGTGTGGTACC 
ETSIGEGANLASLSKFNTPW 

BstXI 



AGAAGGTTTTTCACATCCTTGCCTGTCTATGCCATTATTGTGGCAAACTTTTGTAGAAGC 

961 + + + + + + 1020 

TC TTC C AAAAAGTGTAGG AAC GGAC AGATAC GGT AATAAC AC C GTT TGAAAAC ATC TTCG 

RRFFTSLPVYAI IVANFCRS 



Fig. 1 cont'd 
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TGGACCTTCTATTTGCTCTTAATAAGTCAGCCTGCTTACTTTGAAGAGGTCTTTGGGTTT 

1021 + + + + + + 1080 

ACC TGGAAGATAAACGAGAATTATTC AGTC GGAC GAATGAAAC TTCTC C AGAAAC CC AAA 

WTFYLLLISQPAYFEEVFGF 

Bsal PvuII Drain BsaBI HphI Acc65I 



GCAATAAGTAAGGTGGGTCTCTTGTCAGCTGTCCCACACATGGTGATGACAATCGTGGTA 

1081 + + + + + + 1140 

CGTTATTCATTCCACCCAGAGAACAGTCGACAGGGTGTGTACCACTACTGTTAGCACCAT 

AISKVGLLSAVPHMVMTIVV 

Kpnl 



CCCATTGGAGGACAACTGGCTGATTATTTAAGAAGCCGAAAGATTTTGACCACAACTGCT 

1141 + + + + + + 1200 

GGGTAACCTCCTGTTGACCGACTAATAAATTCTTCGGCTTTCTAAAACTGGTGTTGACGA 

PIGGQLADYLRSRKILTTTA 

BspHI 



GTCAGAAAGATCATGAATTGTGGAGGCTTTGGCATGGAGGCAACCTTGCTCCTGGTGGTT 

1201 + + + + + + 1260 

CAGTCTTTCTAGTACTTAACACCTCCGAAACCGTACCTCCGTTGGAACGAGGACCACCAA 

VRKIMNCGGFGMEATLLLVV 

BstXI 



GGGTTTTCCCATACCAAAGGAGTGGCTATCTCCTTCCTGGTGCTTGCTGTAGGATTTAGT 

1261 + + + + + + 1320 

CCCAAAAGGGTATGGTTTCCTCACCGATAGAGGAAGGACCACGAACGACATCCTAAATCA 

GF3HTKGVAI SFLVLAVGFS 

GGCTTTGCAATTTCAGGTTTCAATGTCAACCACCTGGACATTGCTCCACGATATGCCAGC 

1321 + + + + + + 1380 

CCGAAACGTTAAAGTCCAAAGTTACAGTTGGTGGACCTGTAACGAGGTGCTATACGGTCG 

GFAI SGFNVNHLDIAPRYAS 

Xcml 



ATCCTCATGGGGATCTCAAATGGCGTGGGAACCCTCTCTGGAATGGTTTGTCCCCTCATT 

1381 + + + + + + 1440 

TAGGAGTACCCCTAGAGTTTACCGCACCCTTGGGAGAGACCTTACCAAACAGGGGAGTAA 

ILMGI SNGVGTLSGMVCPLI 



Fig. 1 cont'd 
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Smal 



Xmal 



XmnI 



GTTGGTGCAATGACAAAGCACAAGACCCGGGAAGAATGGCAGAATGTGTTCCTCATAGCA 

1441 + + + + + + 1500 

CAACCACGTTACTGTTTCGTGTTCTGGGCCCTTCTTACCGTCTTACACAAGGAGTATCGT 

VGAMTKHKTREEWQNVFLIA 

BsiHKAI Mlyl 



ApaLl 



Plel 



GC C C TGGTGC AC TAC AGTGG AGTC ATC TTC T AC GGGGTC TTTGCTTCTGGGGAAAAAC AG 

1501 + + + + + + 1560 

CGGGACCACGTGATGTCACCTCAGTAGAAGATGCCCCAGAAACGAAGACCCCTTTTTGTC 

ALVHYSGVIFYGVFASGEKQ 

Bmrl BplI 



GACTGGGCTGATCCAGAGAATCTCTCTGAGGAGAAATGTGGAATCATTGACCAAGATGAA 

1561 + + + + + + 1620 

CTGACCCGACTAGGTCTCTTAGAGAGACTCCTCTTTACACCTTAGTAACTGGTTCTACTT 

DWADPENLSEEKCGI IDQDE 

BssSl 



TTAGCCGAGGAAACAGAACTCAACCACGAGGCTTTCGTAAGTCCCAGAAAGAAGATGTCT 

1621 + + + + + + 1680 

AATC GGC TC CTTTGTCTTGAGTTGGTGC TC C GAAAGC ATTC AGGGTCTTTC TTC TAC AGA 

LAEETELNHEAFVSPRKKMS 

Bbsl Ecil 



TATGGAGCC AC CAC CC AGAATTGTGAGGTCC AGAAGACGGATCGGAGACAACAGAGAGAA 

1681 + + + + + + 1740 

ATACCTCGGTGGTGGGTCTTAACACTCCAGGTCTTCTGCCTAGCCTCTGTTGTCTCTCTT 

YGATTQNCEVQKTDRRQQRE 
TaqI 

TCCGCC TTC GAGGGGGAGGAGC C ATTATCC TAC C AGAATGAAGAGGAC TTTTC AGAAAC A 

1741 + + + + + + 1800 

AGGCGGAAGCTCCCCCTCCTCGGTAATAGGATGGTCTTACTTCTCCTGAAAAGTCTTTGT 

SAFEGEEPLSYQNEEDFSET 



Fig. 1 cont'd 
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BbvCI 
BpulOI 



TCTTAACGTGCATCTTCCCCTCAGCTTACAACCAGAAGTCTCCACACCCATTGCTTTTCC 

1801 + + + + + + 1860 

AGAATTGCACGTAGAAGGGGAGTCGAATGTTGGTCTTCAGAGGTGTGGGTAACGAAAAGG 

S*RASSPQLTTRSLHTHCFS 

Pf 1MI 

CATACCTTGGCCTTCCAGGGGGCCAAATCACAGGAAAGGGGGAGACTAAATCAACAACAG 

1861 + + + + + + 1920 

GTATGGAACCGGAAGGTCCCCCGGTTTAGTGTCCTTTCCCCCTCTGATTTAGTTGTTGTC 

HTLAFQGAKSQERGRLNQQQ 

Bsal 



AGAAGAAAAATGCCTTCTTACAAAGATGGGCGTATGGATCTTGGTCTCAGTTAATTAGAT 

1921 + + + + + + 1980 

TC TTC TTTTTACGGAAGAATGTTTC TAC CC GCATAC CTAGAACCAGAGTCAATTAATCT A 

RRKMPSYKDGRMDLGLS *LD 

Bell Mfel 

AGTTGATCATATTTTTTTTGGGGGGGGCAATTGGGCATTGGCTGTTGAGCCTTCTCTCAA 

1981 + + + + + + 2040 

TC AAC TAGTAT AAAAAAAAC CC C CC CCGTTAACCCGTAACCGAC AACTC GGAAGAGAGTT 

S * SYFFWGGQLGIGC *AFSQ 

AAGAACAATTTATTCAGGAAGAAATGGCTAGAAGAATAAGGAGTGGCTTGTTGCTCAAAT 

2041 + + + + + + 2100 

TTCTTGTTAAATAAGTCCTTCTTTACCGATCTTCTTATTCCTCACCGAACAACGAGTTTA 

KNNLFRKKWLEE * GVACC SN 

Eco57I TatI Bpml 



AAACACTGAAGAAATCCCTCTTTGGTCTGGAGAAGAGTACATGGTGGTTGCCACCCCATC 

2101 + + + + + + 2160 

TTTGTGACTTCTTTAGGGAGAAACCAGACCTCTTCTCATGTACCACCAACGGTGGGGTAG 

KH*RNPSLVWRRVHGGCHPI 



Fig. 1 cont'd 
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ECOO109I 

EcoRV EcoNI PspOMI 



TCCAAGGATATCCATGTAGAGGACAATCTCTGCAACCTAATGAAGGGAATCACTCATGGG 

+ + + + + + 

AGGTTCC TATAGGTACATCTC CTGTTAGAGACGTTGGATTACTTC CCTTAGTGAGTAC CC 

SKDIHVEDWLCNLMKGITHG 



Apal 
EcoO109I 



GGCCCTTGGTTGTGCCAGGTGCTTTATGAACATTCTTATTTAACTCCCACACCCTAATAT 

+ + + + + + 

CCGGGAACCAACACGGTCCACGAAATACTTGTAAGAATAAATTGAGGGTGTGGGATTATA 

GPWLCQVLYEHSYLTPTP*Y 



AGTTATTGTACCCATTTTACAACTT^AGAACATTAAATGACTAGGTTGGCCCACCCAAGGT 

+ + + + + + 

TCAATAACATGGGTAAAATGTTGATTCTTGTAATTTACTGATCCAACCGGGTGGGTTCCA 

SYCTHFTTKNIK*LGWPTQG 

TGTCCTCTCAGAGCCAAAGCTGAGACTGGCAGATGACCAGGAGTTTTAGGAAGGAAGGAA 

+ 4- + + + + 

ACAGGAGAGTCTCGGTTTCGACTCTGACCGTCTACTGGTCCTCAAAATCCTTCCTTCCTT 

CPLRAKAETGR* PGVLGRKE 

GGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGGTTCAGTTGAGTGT 

+ + + + + + 

CCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCCAAGTCAACTCACA 

GRKEGRKEGRKEGRKGSVEC 

AGGGTCATTTTCAATGACAAAAACAAAAACTGGAATCAGTTGGTTTGTGGGTAATTCCAT 

+ + + + + + 

TCCCAGTAAAAGTTACTGTTTTTGTTTTTGACCTTAGTCAACCAAACACCCATTAAGGTA 

RVI FNDKNKNWNQLVCG* FH 

SphI 

GTTTGGTCAAGGGTGTGTGCATGCAAACGTGTATGTGCGTGTGTGTGTGTTTGTGTGTTT 

+ + + + + + 

CAAACCAGTTCCCACACACGTACGTTTGCACATACACGCACACACACACAAACACACAAA 

VWSRVCACKRVCACVCVCVF 

Fig. 1 cont'd 
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GNGTGTNAGNNNGNATNANAANAAAAN 

2581 + + 2607 

CNCACANTCNNNCNTANTNTTNTTTTN 



Fig. 1 cont'd 
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Fig. 2 A 
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Fig. 2B 
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Fig. 4 A 
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Fig. 4B 
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Fig. 5A, 5B, and 5C 
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Fig. 5D 
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